- 
                Notifications
    You must be signed in to change notification settings 
- Fork 114
[Port] Have added Wilson's algorithm and loop erased random walks #24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| Codecov Report❌ Patch coverage is  Additional details and impacted files@@            Coverage Diff             @@
##           master      #24      +/-   ##
==========================================
- Coverage   99.45%   99.44%   -0.02%     
==========================================
  Files         106      107       +1     
  Lines        5554     5602      +48     
==========================================
+ Hits         5524     5571      +47     
- Misses         30       31       +1     🚀 New features to boost your workflow:
 | 
| @nassarhuda, are you familiar with Wilson's algorithm? Could you take a look here? | 
| function wilson_rst end | ||
| @traitfn function wilson_rst(g::AG::(!IsDirected), | ||
| distmx::AbstractMatrix{T}=weights(g); | ||
| seed::Int=-1, | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we really need a seed, when we can pass in a random generator?
| tree = wilson_rst(g) | ||
| probability_of_tree = prod([weights(g)[src(e), dst(e)] for e in tree]) | ||
| probability_of_tree /= det(laplacian_matrix(g)[2:nv(g), 2:nv(g)])) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we use a julia or jldoctest block here?
| start1 = rand(rng, 1:nv(g)) | ||
| start2 = rand(rng, 1:nv(g)) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| start1 = rand(rng, 1:nv(g)) | |
| start2 = rand(rng, 1:nv(g)) | |
| start1 = rand(rng, vertices(g)) | |
| start2 = rand(rng, vertices(g)) | 
might make more sense, although at least for SimpleGraph, this will not be of type Int but of type eltype(g)
| seed::Int=-1, | ||
| rng::AbstractRNG=GLOBAL_RNG | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seem here, do we need seed?
| f::Union{Set{V},Vector{V}}=Set{Integer}(), | ||
| seed::Int=-1, | ||
| rng::AbstractRNG=GLOBAL_RNG | ||
| )::Vector{Int} where {T <: Real, U, AG <: AbstractGraph{U}, V <: Integer} | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If I remember ::SomeType after a function does actually force convert to be used on the result of that function.
| @etiennedeg @gjh6 I think this is something useful to have. Given that this PR has been lying dormant for quite a while, there might be some things that you would do differently? Then maybe it is better, if you would refactor this before me or someone else continues with reviewing. | 
| rng = getRNG(seed) | ||
| end | ||
|  | ||
| visited = Vector{Integer}(undef, 1) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Vector should be typed with eltype(g) (passed by dynamic dispatch)
| i += 1 | ||
| end | ||
|  | ||
| length(f) == 0 || visited_view[cur_pos] in f || throw(ErrorException("termiating set was not reached")) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| length(f) == 0 || visited_view[cur_pos] in f || throw(ErrorException("termiating set was not reached")) | |
| length(f) == 0 || visited_view[cur_pos] in f || throw(ErrorException("terminating set was not reached")) | 
| break | ||
| end | ||
| nbrs = neighbors(g, cur) | ||
| length(nbrs) == 0 && break | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we do this check only for the first vertex? Graph is undirected so if we move to an adjacent neighbor, this vertex must have at least one neighbor ?
| end | ||
| nbrs = neighbors(g, cur) | ||
| length(nbrs) == 0 && break | ||
| wght = [distmx[cur, n] for n in nbrs] | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| wght = [distmx[cur, n] for n in nbrs] | |
| wght = @view distmx[cur, nbrs] | 
Maybe this is better?
| if v in visited_view | ||
| cur_pos = indexin(v, visited_view)[1] | ||
| visited_view = view(visited, 1:cur_pos) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| if v in visited_view | |
| cur_pos = indexin(v, visited_view)[1] | |
| visited_view = view(visited, 1:cur_pos) | |
| new_cur_pos = findfirst(v, visited_view) | |
| if !isnothing(new_cur_pos) | |
| cur_pos = new_cur_pos | |
| visited_view = view(visited, 1:cur_pos) | |
| end | 
| end | ||
|  | ||
| visited = Vector{Integer}(undef, 1) | ||
| visited_view = view(visited, 1:1) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it necessary to carry a view? The only place where it is used is line 164-165 (and only in one line with my proposed change). The indexing of visited_view is the same as visited because it always start at the first index of visited, so in line 152, 156 and 178, we can replace visited_view by visited.
| end | ||
|  | ||
| visited_vertices = Set(walk) | ||
| unvisited_vertices = setdiff(Set([i for i = 1:nv(g)]), visited_vertices) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is more efficient to just flag visited vertices (with a linear pass over walk) than to do complex set operations (which I think are not even linear...). We just maintain a BitVector is_visitedof size nv(g).
In loop_erased_randomwalk, e in f becomes is_visited[e], which is also more efficient.
|  | ||
| walk = loop_erased_randomwalk(g, start1, distmx=distmx, f=[start2], rng=rng) | ||
|  | ||
| tree = SimpleGraph(nv(g)) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tree is only used to together with add_edge!, which is a costly operation. It is more efficient to directly store the unique edges.
| end | ||
|  | ||
| visited_vertices = Set(walk) | ||
| unvisited_vertices = setdiff(Set([i for i = 1:nv(g)]), visited_vertices) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| unvisited_vertices = setdiff(Set([i for i = 1:nv(g)]), visited_vertices) | |
| unvisited_vertices = setdiff(Set(vertices(g)), visited_vertices) | 
this is a port of #1576